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Abstract 

Algorithms for persistent homology and zigzag persistent homology are well studied for homology 
modules where homomorphisms are induced by inclusion maps. However, the same is not true for ho- 
bJQ[ momorphisms induced by other continuous maps such as simplicial ones. In this paper, we propose a 

practical algorithm for computing persistence and zigzag persistence with homology under Z2 coeffi- 
cients for a sequence of general simplicial maps. We leverage the fact that every simplicial map can be 
simulated by inclusion maps, but not necessarily in monotone direction. This helps to convert a (possi- 
bly zigzag) filtration induced by simplicial maps into another zigzag filtration induced by only inclusion 
maps, and the two nitrations share the same persistence diagrams information. Furthermore, the persis- 
tent homology for a non-zigzag filtration connected by simplicial maps can be directly computed using 
the recently introduced concept of annotations. The maintenance of a consistent annotation implies the 
maintenance of a consistent cohomology basis, which by duality, also implies a consistent homology 
basis. This leads to an improved algorithm for computing persistence homology induced by simplicial 
maps. With this new tool, we also provide an alternative way to approximate the persistence diagram of 
a filtration of Rips complexes where vertex collapses are used to tame the blow-up in size. 
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1 Introduction 



Topological persistence was originally defined and explored for nested sequence of simplicial complexes |[T3l 
which induced homomorphisms among their homology groups under inclusions. Later, further develop- 
ments considered persistent homology and zigzag persistent homology for general homomorphisms 171 [181. 
Although persistent homology and zigzag persistent homology under inclusions have been investigated from 
algorithmic view point quite elaborately and successfully (5J|8j[T0l[T3l[T6l, the same is not true for general 
homomoiphisms. Except for the work of j7J, little is known about it. In 0, Carlsson and de Silva presented 
a generic prototype algorithm for computing zigzag persistence homology for general homomorphisms that 
requires an explicit representation of the homomorphisms between the homology groups of two consecutive 
complexes in the sequence. In particular, if the input is given in terms of maps between input spaces such 
as a continuous map / : Ki — > -fQ+i between two consecutive complexes in a given filtration, a represen- 
tation of the induced homomorphism /* : H*{Ki) — > H*(Ki + i) between the homology groups needs to be 
computed. This map, often represented with a matrix, can be obtained by first computing a basis of H*(Ki) 
and .ff*(i<Q + i) and then a correspondence between these bases by interpreting given representation of /. 
Often this step is costly and in general, this eventually leads to 0(n 4 ) algorithm where each input complex 
has 0(n) simplices. In contrast, when the map / is an inclusion, the persistence algorithm can compute the 
persistent homology in 0(n 3 ) time where n is the total number of simplices inserted. 

In this paper, we consider filtrations connected by arbitrary simplicial maps. We show in Section |2] that 
we can compute the zigzag persistence homology for this case with a running time matching that for the 
inclusion-induced filtrations. Since any continuous map between simplicial complexes can be realized by 
simplicial maps, this embraces generality. We first show that any simplicial map / : K. t — > K: l+ i can be 
simulated by a zigzag map involving only inclusions Ki <— K <-^> -fQ+i where K is an intermediate com- 
plex built from K{. This converts the input zigzag filtration connected by simplicial maps to another zigzag 
filtration but connected only by inclusion maps, and it turns out that these two filtrations share the same 
persistence diagrams ifTTl . We can then take advantage of efficient algorithms to compute the persistence 
diagram for an inclusion-induced zigzag filtration 00 [161 to obtain the persistence diagram for the more 
general zigzag filtrations connected by simplicial maps within the same time complexity. 

Next, in Section [3l we further improve the above method for input filtrations that do not zigzag (i.e, all 
simplicial maps are of the form / : Ki — > Ki + {). In Section |2l the construction of K from Ki involves 
inserting simplices that are in the cone of a vertex to the closed star of another vertex. We show that simpli- 
cial maps can be handled directly (without the need of introducing K) by availing the notion of annotations 
introduced in |2). The annotations are linked with cohomology basis which by duality corresponds to a 
homology basis. Thus, maintaining annotations over insertions and edge collapses allows us to maintain 
a consistent homology basis indirectly under simplicial maps and infer the homology persistence. It turns 
out that maintenance of annotations is not only amenable to inclusions, but also to vertex collapses that are 
key to simulating simplicial maps. Although the new algorithm has the same asymptotic time complexity as 
the one described above, it should be much more efficient in practice. This is because it avoids the detour 
through K, and thus requires many fewer operations to move from Ki to Ki + \. 

Finally, we show that our algorithm from Section [3] can help approximate the persistence diagram of 
a Vietoris-Rips (Rips in short) filtration. Although Rips complex is a versatile data structure for topology 
inference, its size has been a bottleneck for computations. Recently, Sheehy iflTI initiated the problem of 
approximating the persistence diagram of the Rips filtration of n points by another filtration where each 
complex has size linear in n. His novel approach allows vertices to be collapsed (deleted) when the param- 
eter r for the Rips complex grows so as to keep the size of the Rips complex small. Sheehy then developed 
an elegant weighting scheme to replace the simplicial maps induced by vertex collapses with a filtration 
connected by inclusions, so that classical algorithms such as |[T3l can be employed. Our algorithm for 
computing persistence of simplicial maps obviates the necessity of the weighting scheme. 
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2 Simplicial maps 



Definition 2.1 Given a finite set V, a simplicial complex K = K(V) is defined as a collection of subsets 
{a C V} so that a € K implies that any subset a' C a is in K. We call V the vertex set of K and the 
elements of K its simplices. An element a £ K is a p-simplex if its cardinality is p + 1. A simplex a' is a 
face of a and a is a coface of a' if a' C a. 

Definition 2.2 Let Xbea subset of a simplicial complex K. The set St X := {a' £ K \ 3a € X and a C a'} 
is called the star of X. The closure of X, denoted X, is the simplicial complex formed by simplices in X 
and all of their faces. The link of X is hk X := St X \ St X . 

The star of X consists of the set of simplices in K that are cofaces of a simplex in X. The link of X is 
the collection of simplices that are the faces of the simplices in the star of X but are not in the star of the 
closure of X. 

Definition 2.3 A map f : K — >• K' is called simplicial if for every simplex a € K, f(a) is a simplex in K'. 
Every simplicial map induces a vertex map, namely the restriction fyofftoV = V(K), but the converse 
is not necessarily true. 

Definition 2.4 A simplicial map f : K — > K' is called elementary if the induced vertex map fy is injective 
everywhere except possibly on a set X C V(K), \X\ > 1, for which fy{X) is a single vertex in K'. 

• If X is empty, f is an inclusion. Furthermore, if K' has exactly one more simplex than K, it is called 
an elementary inclusion. 

• If f is surjective but not injective, f is a collapse. If in addition we also have that \X\ = 2, then it 
is called an elementary collapse. An elementary collapse involves a pair of vertices (n, v) where the 
vertex map collapses them into a single vertex, and is injective everywhere else. 

First we establish that any simplicial map can be built as a composition of elementary simplicial maps 
(Proof is in the Appendix). 

Proposition 2.5 If f : K — s> K' is a simplicial map, then there are elementary simplicial maps fi 

K h Ki H K 2 ■ ■ ■ h K n = K' 

so that f = f n ° fn-i ° ' ' ' ° /l- Furthermore, one may take each fi as an elementary collapse or an 
elementary inclusion. 

In view of Proposition 12.51 it is sufficient to show how one can extend the persistence algorithm for an 
elementary simplicial map. At this point, we make an innocuous change in the definition 12.41 of elemen- 
tary simplicial maps that eases further discussions. We let fy to be identity (which is an injective map) 
everywhere except possibly on a set X C V{K) for which fy(X) is a single vertex fy(v) = v in K'. By 
renaming the vertices in K' that are mapped onto injectively, this change becomes harmless. Specifically, 
an elementary collapse merges two vertices (u, v) into one of them, say to u. 

Now we focus on elementary collapses since the standard persistence algorithm handles inclusions. So, 
let / : K — > K' be an elementary collapse. Assume that the induced vertex map collapses vertices u,v G K 
to u € K' , and is identity on other vertices. For a subcomplex X C K, the cone u * X is the complex 
{a U {u}\a £ X}. Consider the augmented complex 

K := K U u*Stv. 
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In other words, starting from K, for every simplex {uo, . . . , Ud} € St v, we add the simplex {uq, . . . , Ud] U 
{it} to K if it is not already in. In Figure [T] u * St v consists of all shaded triangles and their faces in K. 



K 




Claim 2.6 K' C K (Proof in Appendix). 

Now consider the canonical inclusions i : K ^ K and il : K' <-} K. We want to 
show that the diagram on the left commutes at the homology level. We use the notion 
of contiguous maps which induces equal maps at the homology level. Two maps f\ : 
K\ — > K2, fi ■ K\ -4 K2 are contiguous if for every simplex a 6 K\, /i(cr) U /2(c) is 
a simplex in K 2 . Simple observations lead to the following. 

Proposition 2.7 The simplicial maps i' o / and i are contiguous (Proof in Appendix). 
Next we show that i' induces an isomorphism at the homology level. 

Proposition 2.8 : H*(K') — > H*(K) is an isomorphism. 

Proof: Consider the projection map n : K K' induced by the vertex map 



7t(» 



if p = v 
otherwise. 



Let idf> denote the identity map on K. We show that: (i) it is an elementary collapse, and (ii) the composition 
i' o 7T and id^ are contiguous. It is easy to see that tt o i' is idx'- Then, i' is a (simplicial) homotopy 
equivalence and hence i'^ is an isomorphism. Complete proof appears in Appendix. ■ 
Now we can claim that the persistence for the map / : K — > K' is same as the zigzag persistence for 

i ~ i' . 

the maps K "—t K -f^ K . This boils down to showing the following: 

Proposition 2.9 /„ : H*(K) -4 H*(K') is equal to (z'J- 1 o i, where H*(K) h H*(K) £ H*(K'). 

Proof: Since i is contiguous to z' o / (Proposition 12.71) . we have = («' o /)* = i'^ o / # . Since i'^ is an 
isomorphism, exists and is an isomorphism. It then follows that /* = (i*)^ 1 o i*. ■ 

Proposition 12.91 allows us to simulate the zigzag persistence of a sequence of simplicial maps with 
inclusions alone. From now on, we assume that the homology groups are denned with coefficients in Z2, 
and thus are vector spaces. 

2.1 Zigzag persistence 

Consider the following sequence of simplicial complexes connected with simplicial maps: 

Kx h K 2 h ■ ■ ■ f & K n 

where the arrows <R- can either be a forward arrow — > or a backward arrow When the maps /j are all 
inclusions, it is known that the zigzag persistence induced by them can be computed in matrix multiplication 
time by a recent algorithm of |[T6l . However, this algorithm does not extend to simplicial maps as per se. 
With our observation that every simplicial map can be simulated with inclusion maps, we can take advantage 
of the algorithm of |[T6l even for computing zigzag persistence for simplicial maps. For specificity, assume 
that we have a zigzag sequence: 

Kl %K 2 £-K 3 $ >K n 
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which generates the module 

T : H*{K X ) ^ H*(K 2 ) £-* H*(K 3 ) % ► H*(K n ) 

In view of Proposition I2.9[ consider the following sequence connected only with inclusions: 

K\ if i K 2 ^-)> K 3 <r^ K 3 • • • ^ if n 

At the homology level we have H*{Ki) ~ H*{Ki + \) induced by inclusion Ki -fQ+i and also 
H*{Ki) ~ H^Ki+i) induced by inclusion ifj <— > Ki+i. Thus we have the following persistence module: 

M : H # {K X ) 4 £T,(#i) ~ F*(K 2 ) ~ iJ*(i? 3 ) £ H*(K 3 ) 4 • • • £■ H*(K n ) 

Theorem 2.10 77?e zigzag persistence of J- can be inferred from the zigzag persistence of the module M. 
Proof: Consider the following diagrams between vector spaces: 

H*(K\) -J-?*. H *(K 2 ) — ^ H,(lf 2 ) ff*(iiC 2 ) iT*(if 3 ) — i?*(i^m) 

H*(K{)-^*- #*(if 2 ) -^U- H*(K 3 ) H,(K 3 ) — H*(K m ) 

All the isomorphisms are induced by inclusions, hence every square involving only isomorphisms com- 
mutes. The commutativity of the squares involving follows from Proposition 12.71 Hence every square in 
this diagram commutes, and the top and the bottom sequences produce the same zigzag persistence. ■ 

3 Simplicial maps and annotations 

In this section we present an algorithm that avoids constructing the intermediate complex K for computing 
the persistence for a non-zigzag sequence of simplicial maps. In this new approach, we may only need to 
consider some missing simplices containing both vertices involved in a collapse. As Figure [T] illustrates, 
these simplices are generally far fewer than the intermediate complex K demands. As we see later, annota- 
tions provide a natural means to handle vertex collapses essential for simulating simplicial maps. The reader 
will need the understanding of cochains, cocycles, and cohomology groups, denoted H P (K), of a simplicial 
complex K to follow the further developments (refer to Hatcher lfT5l ). These are duals to chains, cycles, and 
homology groups H P (K) respectively. Again, we assume that all are defined with coefficients in the finite 
field Z 2 . 

Suppose we are given a sequence of simplicial complexes connected by simplicial maps: 

K x hK 2 hK 3 ..-HK n 

Consider the map : Ki — > Kj where = /,o f i+1 ■ ■ ■ o The persistent homology hij is given by 
the image of induced homomorphism hij : H*(Ki) — > H*(Kj). To compute the persistent homology, the 
persistence algorithm essentially maintains a consistent basis by computing the image hij(Bi) of a basis Bi 
of H*{Ki). Here, instead, we maintain a consistent cohomology basis, that is, if B i is a cohomology basis 

of H*{Ki) maintained by the algorithm, we compute the preimage chJ^B*) where H*(Ki) H*(Kj) 
is the homomorphism induced in the cohomology groups by fy. By duality, this implicitly maintains a 
consistent homology basis and thus captures all information about persistent homology as well. 
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Figure 1: Annotation vs. coning: The pair (u,v) is collapsed to u in K to produce K 1 (middle column). 
The 2-simplices are the shaded triangles alone. Annotation requires inserting (implicitly) the single triangle 
as shown on the left whereas K requires inserting many more simplices as shown on the right. 

Our main tool to maintain a consistent cohomology basis is the notion of annotation introduced in 
recently. The annotations are binary vectors assigned to simplices which are linked to a cohomology basis. 
We maintain the annotations as we go forward through the given sequence, and thus maintain a cohomology 
basis and noting each time the creation and destruction of them for computation of persistence. Now we 
detail on annotations and its relation to cohomology. 

Definition 3.1 Given a simplicial complex K, Let K p denote the set of p-simplices in K. An annotation 
for K p is an assignment a : KP — > of a binary vector a CT = a(cr) of same length g for each p-simplex 
a G K. We also have an induced annotation for any p-chain c p given by a Cp = S CTgCp a,j 

Definition 3.2 An annotation a : K p — > 7L 9 2 is valid if: 

1. g = rank H p (K), and 

2. two p-cycles z\ and Z2 have a Zl = a Z2 iff their homology classes are identical, i.e. [zi] = [22]- 
Proposition 3.3 The following two statements are equivalent: 

1. An annotation a : K p — > is valid 

2. The cochains {4>i}i=i t ... t9 given by 4>i(a) = a a [i]for all a G K p are cocycles whose cohomology 
classes constitute a basis of H P (K). 

Proof: 1 — > 2: The cochains (pi sue cocycles since for any (p + l)-simplex r G K p+1 one has [dr] = [0] 
and hence (t>i{dr) = <fti(0) = 0. Let [zi], [z?], • • • , [z g ] be a basis of H p (K). Let V be the vector space 
generated by [<pi], i = 1, ■ ■ ■ ,g. Define a bilinear form a : V x H P {K) — > Z>2 by a([^i], [zj]) = 4>%{zj). 
The matrix [4>i{zj)]ij has full rank due to the condition 2 in the definition of annotation. This means the 
vector spaces V and H p (K) have same rank and hence are isomorphic. It follows that V ~ H P (K). 

2—^1: Consider a basis [zi], [Z2], • • • , [z g ] of H p (K) and a bilinear form a : H P (K) x H P {K) as 
above. Then, by standard result in topology, the matrix [<j>i{zj)]ij has full rank. This implies that the vectors 
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[(j>i(zj)]i=i t - g and [^t(«fc)]i=i ... , g are identical if and only if [zi\ = [zk]- The claim can be extended to any 
class since it can be expressed as linear combination of the basis. ■ 
In our algorithm, elementary collapses need to modify annotations in the current simplicial complex. 
The result below helps us to transfer annotations among simplices without affecting cohomology bases. 

Proposition 3.4 Let a : K p — > TL\ be a valid annotation. Let a £ K p be any p-simplex and r any of its 
(p — l)-faces. Adding a a to the annotation of all cofaces of t of codimension 1 produces a valid annotation 
for K p . Furthermore, cohomology bases remain unchanged by this change in annotation. 

Proof: Let {fa\i=i,— ,g be a cohomology basis of H P {K) corresponding to a : K p — > lP 2 as stated in 
Proposition 13.31 Let T be the set of cofaces of r of codimension 1 and 



fa(a') if a' eK p \T 

fa(a') + fa(a) ifa'eT 



By construction, <^ is the cochain that corresponds to the new annotation obtained by adding a CT to that 
of the simplices in T. We prove that <p'- is a cocycle in the class [(pi]. Therefore, {[^]}i=i,2,--- ,g is a 
cohomology basis of H' P (K). The new annotation is valid by Proposition 13.31 and the cohomology bases 
remain unchanged. 

If (f>i(o~) = 0, we have (p\ = fa, and thus [<^] = [fa] trivially. So, assume that fa(a) = 1. In this case 
fa = 1 + (pi on T and equals fa everywhere else. Consider the (p — l)-cochain <p denned by far) = 1 and 
far') = for every r' € K p ~i \ r. Then the coboundary 5<p is a p-chain that is 1 for every simplex in T 
and on other p-simplices. We can write fa i = fa-\- Sep. It follows that [fa] = [fa]. ■ 

Notice that after this local modification of annotation as specified in this proposition, we have a CT = 0, 
which facilitates the deletion of the simplex a. 



3.1 Algorithm 

We only need to show how to maintain the consistent cohomology basis for a single elementary map since 
by assumption each map f is either an elementary collapse or an elementary inclusion. The annotations 
implicitly maintain a cohomology basis whose bits are also time stamped to signify when a basis is born. 
When a basis is killed, we know its persistence by looking at its time stamp for birth. 



Elementary inclusion. Consider an elementary inclusion Ki Ki + \. Assume that Ki has a valid anno- 
tation. We describe how we obtain a valid annotation for from that of Ki after inserting the p-simplex 
a = Ki + \ \ Ki. We compute the annotation zq^ for the boundary da in Ki and take actions as follows. 

Case (i): If ag a is a zero vector, the class [da] is identity of H p -i(Ki). This means a creates a p-cycle 
in Ki + i and by duality a new p-cocycle. In this case we augment the annotations for all p-simplices by 
one bit with a time stamp i + 1, that is, an annotation [b\, 62, • • • , b g ] for a p-simplex r is updated to 
[bi, hi, ■ ■ ■ , b g , b g+ i] with the last bit time stamped i + 1 where h g+ \ = for r ^ a and h a+ i = 1 for r = a. 
Note that the bit hi (1 < i < g) of a is zero. Other annotations remain unchanged. 

Case (ii): If ag a is not a zero vector, the class of the (p — l)-cycle da is nontrivial in H p -\(Ki). Therefore, 
a kills the class of this cycle and a corresponding dual class of cocycles. We simulate it by forcing ag a to 
be zero which affects other annotations as well. Let i\, %i, ■ ■ ■ , = u be the set of indices in increasing 
order so that 6^,6^, • • • , bi k = b u are all of the nonzero bits in ag G = [61,62, ■ ■ ■ , b u , ■ ■ ■ ,b g ). The cocycle 
(p = fa x + fa 2 + ■ • ■ + (fa k = (p u ) should become a coboundary after the addition of a, which renders 

4>u = <Ph + <Pi 2 H 1" 4>ik-i- 



6 



We choose the latest cocycle fa to be dependent on others. In other words, the cocycle class [<p] born at 
the time stamp of b u is killed at time i + 1 by the addition of of a. This pairing matches that of standard 
persistence algorithm where the youngest basis is always paired first. To simulate the purport of the above 
equation, we replace the class [fa] with the class [fa + fa + • • • + fa_j] and delete the class [fa] from the 
cocycle basis. So, we add the vector ag CT to all annotations of (p — l)-simplices whose uth bit is nonzero. 
This renders the uth bit of all annotations of (p — l)-simplices to be zero. We simply delete this bit from all 
such annotations. See the case in upper row of Figure El 

Elementary collapse. The case for handling collapse is more interesting. First we state some results 

known for elementary collapse. Consider an elementary collapse Ki -4 ifj+i where the vertices (u, v) 
are collapsed to u. The following link condition was introduced in lPT2l in the context of maintaining 
homeomorphism through edge collapses. 

Definition 3.5 The pair (u, v) satisfies the link condition if edge {u, v} G Ki, and Lk uDhk v = Lk {u, v}. 

It is known that if Ki is a 2- or 3-manifold, the underlying spaces \Ki\ and are homeomorphic 

if (u, v) satisfies the link condition. Instead of homeomorphism, if we require only homotopy equivalence, 
we can relax the condition of Ki being only 2- or 3-manifolds, thanks to a result in HI . 

Proposition 3.6 ^[7J/ If (u, v) satisfies the link condition, then \K{ \ and |-FQ+i | are homotopy equivalent. 

Definition 3.7 For an elementary collapse fi : — > Ki+i, a simplex a G Ki is called vanishing if the 
cardinality of fi(o~) is one less than that of a. Two simplices a and a' are called mirror of each other if 
one contains u and the other v, and share rest of the vertices. In Figure \J\ the vanishing simplices are 
{{u, v}, {u, v, t}} and the mirror pairs are {{u}, {v}}, {{u, w}, {v, w}}, {{u, t}, {v, t}}. 

We now explain how to update the annotations 
when an elementary collapse /j : Ki — > Ki + i col- 
lapses (it, v) to u. 

Case (i): (u, v) satisfies the link condition. In 
this case /j* : H*{Ki) — > H*(Ki + \) is an iso- 
morphism by Proposition 13.61 Then, by duality 
the corresponding cocycle bases need not change. 
However, all vanishing simplices for the collapse 
fi need to be deleted, and all simplices contain- 
ing v need to be pulled to the vertex u which re- 
quires changing their annotations. We update the 
annotations in such a way that the annotations of 
all vanishing simplices become zero, and those of 
the mirror simplices become the same. We will ar- 
gue later that once this is achieved, the collapse can 
be implemented by simply deleting the vanishing 
simplices and replacing v with u in all simplices 
containing v without changing their annotations. 

We update the annotations in Ki as follows. 
Let r be any mirror simplex containing u. It has 
a unique coface, say a containing the edge {it, v}. 
We add a CT to the annotations of all cofaces of r of codimension 1 including a. By Proposition [33J the new 



00 




u 00 v u 



Figure 2: Updating annotations: 1) inserting the blue 
triangle in upper row; it kills a 1 -cycle whose anno- 
tation was 10 originally. The first bit 1 is added to the 
single edge annotation which is 1 1 and make it 01. The 
first bit is deleted from all annotations. 2) collapsing 
uv in lower row; the annotation 11 of uv is added to 
all edges containing u before the collapse. Only edge 
annotations are shown. 
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annotation generated by this process corresponds to the old cohomology basis for Ki. This new annotation 
has a a as zero since a a + a a = 0. See the case in lower row of Figure |2j 

Observe that the above process also applies to all faces of r if \t\ > 1. Then, r acquires an iden- 
tical annotation to that of its partner mirror simplex. Notice that no time stamp changes in this case. 

K fi K Case (ii): (u, v) does not satisfy the link condition. We consider the simplicies a, 

in increasing order of dimensions, whose absence from Ki makes (u, v) violate 
the link condition. For each such a, we modify the annotations of all simplices 
that we would have done if a were to be inserted. Thereafter, we carry out the 
algorithm for case (i) of elementary collapse. In essence, implicitly, we obtain 
an intermediate complex Ki = KiU {a} where the diagram on left commutes with f[ being the extension 
of fi to Ki. This means that the persistence of fi is identical to that of f[ o % which justifies our action of 
elementary inclusion followed by an elementary collapse. 

We remark that this is the only case where we may insert implicitly a simplex a in current approach. 
The number of such a is usually much smaller than the number of simplices in the cone u * St v that we 
would need to insert for the algorithm in Section |2j recall Figured] 



Justification. We justify the updates of annotations in the above four cases. In particular, we need to show 
that the annotations produced by our update algorithm for K i+ i is still valid. The correspondence among 
the cohomology bases is automatically maintained through such valid annotations due to Proposition 13.31 
which we exploit for computing persistence by time stamping. Let ai denote the annotation of a cycle z in 
Kj for j = i, i + 1. For the two cases of elementary inclusions, for space limitation, we will only provide 
the argument for Case (ii) here, and the argument for Case (i) can be found in the Appendix. 
Case (ii) of elementary inclusion: The only annotations altered are for dimensions p and p — 1. In dimension 
p the only change is the addition of a along with its zero annotation. In this case, a cannot participate in 
any p-cycle in 2£j+i because otherwise da should have zero annotation in Ki. Therefore, annotation for 
dimension p remains valid in 2£j+i. So, we focus on dimension p — 1. 

Let z and z' be two (p — l)-cycles with [z] = [z'\ in Ki + \. Observe that both z and z' are also (p — 1)- 
cycles in Ki. Recall that aQ a has been added to all (p — l)-simplices with uth bit equal to 1. Hence, the uth 
bit of any (p — 1) -cycle is exactly equal to the parity of the number of (p — 1) -simplices in it with nth bit 
equal to 1. If [z] = [z'\ in Ki, we have a\ = a*,. In particular, the uth bit of a* and a l z , are the same implying 
that ag a has been added with the same parity to a* and a*,. Therefore, a\ +l = a^t 1 - Consider the other case 
when [z] ^ [z'] in Ki. Then, there must be a p-chain D in Ki such that d{D + a) = z + z' in Ki + \. We get 
dD = z + z' + da and hence z + z' + da = in Ki. So, the annotation of the cycle z + z' + da is zero in 
Ki. Since nth bit of a l dcr is 1, it must be true that a* and a*, differ in the nth bit which means 

a 2 + a 2 , = a 2 + a z , + a a<7 = 0. 

Now suppose that [z] ^ [z'\ in Ki + \. Clearly, [z] ^ \z'\ even in Ki implying a* ^ a l z ,. If nth bits of a* 
and a*/ are the same, we will have 

3 z — 3 z + 3 dcr T a z> + a da ~ 3 z > 

proving the required. So, assume that nth bits of a* and aL are different. Without loss of generality, assume 
that uth bit of a* is 1 and that of a 1 , is 0. We claim that a\ + 3g a ^ a l z ,. Suppose not. Then, by definition 
of annotation, [z + da] = [z'\ in Ki. Since [da] = [0] in -fQ+i, we have [z + da] = [z] = [z 1 ] in ifj+i 
reaching a contradiction that [z] ^ [z'\ in Ki + \. Therefore, we have a z +1 ^ a^ 1 because 

a z — a z + a dcr T a z' — a z> ■ 
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Case (i) of elementary collapse: Let a be a p-simplex where our algorithm adds its annotation to all other 
p-simplices containing r: Recall that r is a (p — l)-face of a containing u. Adding a^ to all cofaces of r 
of codimension 1 (we refer to this as an annotation transfer) creates a new annotation which is still valid 
for Kj by Proposition 13.41 At the end of all annotation transfers for all a, we have a valid annotation for 
Ki with the same cohmology basis such that all vanishing simplices have zero annotation, and each pair of 

miiTor simplices have the same annotation (See the Appendix for the proof.) 

/ 

Observe that, under the elementary collapse Ki — > ili+i, the set of vanishing simplices are exactly those 
simplices a for which f(a) has a lower dimension than a. A pair of mirror simplices r and t' are those that 
satisfy that /(r) = /(r') (i.e, the simplex t' containing v coincides with its mirror partner r containing u). 
Hence after the elementary collapse, for any p-simplex a G Ki, if f(a) is ap-simplex, we have a CT = ay( CT ) 
by construction. We can now finish the argument that this induced annotation for Ki+i is also valid for 
Ki + \. The details are given in the Appendix. 

Case (ii) of elementary collapse: This is the combination of elementary inclusions and then elementary 
collapse. The validity of annotations for these elementary operations imply the validity in this case too. 

4 Persistence Diagram Approximation 

In this section, we apply the previous results to approximating persistence diagrams of Rips nitrations. 
Given a set of points V □, let 1Z r (V) denote the Rips complex on point set V with parameter r. That is, a 
(i-simplex a = {uo, . . . , Ud} C V is in lZ r (V) if and only if \\ui — Uj \\ < r for any i, j G [0, d]. In this 
section, we present an algorithm to approximate the persistence diagram for the following Rips filtration. 
The parameters a > and < e < 1 are assumed to be preselected. 

TZ a {V) TZ^ 1+£)a (V) ^ K {1+£)2a {V) ^ TZ il+£) ' ia (V) ^■■■^ K il+£)ma (V). (1) 

As the parameter for the Rips complex increases, the number of /c-simplices in the Rips complex can 
be Q(n k+l ) where n = \V\. This makes computing the persistent homology of the above filtration costly. 
In ifTTl . Sheehy initiated the problem of approximating the persistence diagram of the above filtration by 
another filtration where each simplicial complex involved has size only linear in n. His approach is based 
on a novel framework that allows vertices to be collapsed (deleted) when the parameter r for the Rips 
complex becomes large, which helps to keep the size of the simplicial complex at each stage small. This 
new filtration in essence is simply a filtration connected by simplicial maps. Sheehy then developed an 
elegant weighting scheme to replace the simplicial maps induced by vertex collapses with a filtration again 
connected by inclusions, so that classical algorithms such as lTT3l can be employed. 

In this section, we provide an alternative approach to approximate the persistence diagram of the fil- 
tration given in CO- We follow the same vertex-collapse framework as Sheehy, but handle the filtration 
connected by simplicial maps directly, and thus avoid the need of re-weighting. 

A filtration induced by simplicial maps. Given a set of points P, we say that Q C P is a 5-net of P 
if (i) for any point p G P, there exists a point q € Q such that \\p — q\\ < 5; and (ii) no two points in Q 
are within 5 distance. A 5-net Q can be easily constructed by a standard greedy approach by taking furthest 
points iteratively (see e.g; [9]), or by more sophisticated and efficient methods (see e.g; lfl4l ). 

Now set Vq := V. We first construct a sequence of point sets Vfc, k = 0, 1, . . . , m, such that Vfc+i is a 
^§-(1 + e) fc_1 -net of Consider the following vertex maps tt^ : Vk — > Vk+i, for k G [0, m — 1], where for 
any v G Vk, ^^{v) is the vertex in V^+i that is closest to v. Define tt^ : Vq — > V^+i as vrfc(v) = vr^o - • • -kq{v). 

'For simplicity of presentation, in this section we assume that V is sampled from TR d . Results in this section can be extended 
to the case where points in V are from any metric space with doubling dimension d. 



9 



Each vertex map Tr k induces a well-defined simplicial map h k ■ 1Z a<yl+£ ^ k {V k ) — > TZ a ( 1+£ } k+1 (V k+ i). 
Indeed, since V k +i is a ^ae 2 (l + e) fc_1 -net of V k , each edge e = {u, v} from 1Z a ( 1+£ ) (y k ) satisfies 



\fc+i 



||7r fc (u) -TT k {v)\\ < \\u-v\\ + ||u-7r(u)|| + ||«-7r(v)|| < a(l + e) k + ae 2 (l + e) k ~ L < a(l + e) 

Hence i: k {u)iT k {v) is an edge in TZ^ 1+e ^ +1 {V k+1 ). Since in a Rips complex, higher dimensional sim- 
plices are determined by the edges, every simplex {uq, . . . ,Ud} in TZ a ( 1+£ > has a well-defined image 
{7Tfc(^o) 5 . . . ,7Tk{ud)} in TZ alyl+s ^ + (Vfc + i). Hence, each h k is well-defined, and they provide the filtra- 
tion: 

K«(V Q ) TZ a ( 1+E \Vi) ±h TZ< 1+ ^(V 2 ) ^ TZ< l+e ^ (V 3 ) • • • ^ TZ a ^ m (V m ). (2) 

Our main result is that the persistence diagram of the above sequence (connected by simplicial maps) 
approximates that of £[]> (which is connected by inclusion maps). In particular, we show that the persistence 
modules induced by these two sequences interleave. First, we need maps to connect these two sequences. 
For this, we observe that the vertex map it k : Vq — > V k+ \ also induces a simplicial map h k : lZ a ^ 1+e ^ ( Vq) — > 
<j^a(i+e) k+1 (y k+1 y Specifically, we need to show that if there is an edge {it, v} in TZ a ^ l+£ ^ k {Vq), then there 
is an edge TT k (u)Tr k (v) in 7£ a ( 1+£ ) +1 (y k+ i). Indeed, setting tt-i(u) = u and 7T_i(d) = v, we have: 

\\7T k (u) - 7Tk(y)\\ < \\TT k (u) - u\\ + - v\\ + \\u - v\\ 

k k 

< H^i(n) - 7ri_i(u)|| + \\fti(v) - 7Ti-i(v)\\ + 



tt — v\ 



j=0 



i=0 



< ^(1 + e) 4 + a(l + < a(l + e) k+1 . 

1=0 



Claim 4.1 Each triangle in the following diagram commutes at the homology level. 



n^ 1+c r + (V ) 



n< 1+ ^\v k 




K a W +1 (V k+1 ) 



The above claim implies that the persistence modules induced by sequences (0Q) and © are weakly 
iog(i+e) _ m |; er i eavec i at tne log-scale. By Theorem 4.3 of 0, we thus conclude with the following: 

Proposition 4.2 The persistence diagram of the sequence (0 provides a 3 lo g^+ £ ) -approximation of the 
persistence diagram of the sequence at the log-scale. 

Finally, since 14+1 is a (5-net of V k for 5 = ^-(1 + e) k ~ l , we can show by a standard packing argument 
that each TZ a( - 1+e ^ (V k ) is of size linear in n. The detailed claim and its proof are in Appendix Putting 
everything together, we conclude with the following main result of this section. Note that the persistence 
diagram of the simplicial maps in $2} can be computed using our algorithm from Section l2~Tl or the improved 
algorithm from Section [3] 

Theorem 4.3 Given a set of n points V in a metric space with doubling-dimension d and < e < 1, we 
can 3 lQ gU+ £ ) - a pp W ximate the persistence diagram of the Rips complex filtration by that of the filtration 
(|2]). The p-skeleton of each simplicial complex involved in (O has size 0((j)°( dp ^n). 
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A Missing proofs from Section |2l 



Proof of Proposition 12.5 [ Consider the surjective simplicial map f : K —> f(K) defined as f'(a) = f(a) 
for any a E A. Writing V = V{K) and V = V(K'), we have f v = f v . The simplicial map / : A — > K' 
is a composition i o /', where i : /(A) if' is the canonical inclusion /(A) C A'. Obviously, the 
inclusion i is an elementary simplicial map, and it can be easily decomposed into a sequence of elementary 
inclusions. We now show that /' can be decomposed into a sequence of elementary collapses. 

Let A := {v E V | |/v _1 (u)l > 1}- Hence / v maps injectively onto V \ A. Order vertices in A 
arbitrarily as {v\, . . . ,Vk}, k = \A\, and let Ai denote /v _1 (nj). We now define fi and Aj in increasing 
order of i. For the base case, set A = A. For any i > 0, consider the vertex map f&. which is the injective 
map on V(Ki-i) \ Ai, but maps Ai to V{. We set /j to be the simplicial map induced by this vertex map 
fx v and set Ki := fi{Ki_i). By construction, /j is a surjective elementary simplicial map. 

It is easy to see that the vertex map f^ k o ■ • • o equals fy. Hence, the induced simplicial map 
f k o • • • o f 1 : K — > K k equals f':K—> f(K). Furthermore, each fi can be decomposed into a sequence 
of elementary collapses, each induced by a vertex map that maps only two vertices from Ai to Uj. 

Proof of Claim Hj6j For a simplex a E A' that does not contain u, f is identity on its unique pre-image; 
that is, a E K C K. Now consider a d-simplex a = {u,u±, . . . ,Ud} £ K'. Since / is surjective, there 
exists at least one pre-image of a in K of the form a' = {uo, u\, . . . , Ud}, where «o is either u or v. If it 
is no = u, we have f(a J ) = a' = a and thus a € K C K. So, assume that no = v. This means that the 
simplex {u\, . . . , Ud} is in Lk v (and thus in St v), implying that a = {u, u±, . . . , Ud} G K. 

Proof of Proposition 12.71 By definition of contiguous maps, we need to show that for any simplex a G K, 
i(a) Ui'o f(a) is a simplex in K. Note that i(cr) = a. 

First assume that the simplex a is not in St v. Since / is an elementary collapse, we have f(a) = a and 
i' o /(<t) = cr. Hence, i(<r) U (i' o f)(a) equals a which is also a simplex in K. 

Now assume that a G St v, and a = {no, • • • , Ud} U {u}. Since /(«) = n, /(cr) = {no, . . . , Ud} U {u}, 
and so is (i' o f)(a). Hence the union of i(cr) and {i! o f)(a) is B := {no, . . . , Ud} U {n, n}, which is the 
simplex a U {n}. On the other hand, by construction of K, the simplex a U {n} is necessarily in A'. Hence 
i(a) U (i ; o f)(a) is a simplex in K in this case too. Hence the maps i and i' o f are. contiguous. 

Proof of Proposition 12.81 Specifically, consider an arbitrary simplex a G K. Let X = St {u, v} be the 

star of {n, v} in K. If a £ X, then by the construction of K, a G K' . In other words, n(a) = a indeed 
exists in K' in this case. Furthermore, i' o Tr(a) = a and thus (i! o 7r)(<r) U id^-(cr) = a E K. 

Now consider the case cr G X, and assume that a = {no, ■ ■ ■ , Ud} U ^4 with A C {n, n}. To show that 
7r is well defined, we need to show that vr(cr) = {no, . . . , Ud, n}, is indeed a simplex in K' . 

(i) If u ^ A, then by the construction of K, a has a pre-image in A under the inclusion i : K ^ K. 
Hence cr must also be a simplex in K, and under the map /, it is mapped to the simplex {no, • • • , n^, n} in 
K'. As such, vr(cr) exists in A' in this case. 

(ii) If n G A, then the simplex a" = {no, . . . , n^} U (A \ {n}) must exist in the closed star St k{u, v} 
of {n, n} in A. Hence A contains a simplex cr" U {x} with x being either n or v. Under the map /, the 
image of a" U {x} in A' is {no, . . . , Ud, n}, hence 7r(cr) is well-defined in A' in this case too. 

Furthermore, in both (i) and (ii) above, (i' o ir){a) = vr(cr), and 7r(cr) is a face of the simplex a. Hence 
{i! o it) (a) U id^(a) = a G A. Putting everything together, we have that i' o it and the identity map id^ 
are contiguous. 
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B Missing Details from Section [3] 



Justification for the correctness of Case (i) of elementary inclusion. Case (i) of elementary inclusion: 
For k / p, any £;-cycle in Ki+i was a fc-cycle in Ki and the annotations for fc-simplices are not altered for 
k 7^ p. So, a valid annotation of Ki for k ^ p remains so after inclusion of a p-simplex a. Now consider 
two p-cycles z and z' in Ki + \. We need to show that a* +1 = a^t 1 if and only if [z] = [z']. 

Let [z] = [z 1 ]. If z does not include a, neither does z' and hence both exist in Ki. In this case 

a^ 1 = K,0] = [a;„0] = a^ 1 

since all p-simplices other than a gets the same zero bit appended to their annotations while going from Ki 
to Ki+\. Now consider the case where z includes a. Then, z' also includes a. There is a p + 1-chain, say 
D, so that dD = z + z' '. It follows that 3D = (z + a) + (z' + <r). The p-chains c = z + a and c' = 2/ + a 
do not include a since it gets canceled under Z^-additions. The p-cycle c + d = dD is identity in H p (Ki) 
being a boundary. Therefore, its annotation is zero in Ki giving that a c = a c / in Ki and hence in -fQ+i. It 
follows that z = c + a and z' = d + a have identical annotations in ifi+i. 

Now suppose that [z] ^ [z'\. If none of z and z' include a, they exist in Ki and by the same logic as 
above inherit the same annotations from Ki which cannot be identical because Ki's annotation is valid. If 
exactly one of z and z' includes a, the annotation of one in Ki+\ will have the last bit 1 and that of the other 
will have 0. Thus, they will not be identical. Consider the remaining case where both z and z' include a. 
Consider the cycle z + z' which cannot include a because of ^-additions. Then, the cycle z + z' exists in 
Ki and cannot be in the class [0] because otherwise [z + z'\ will remain identity in H p {Ki + \) contradicting 
[z] / [z'\ in K i+1 . Since [z + z'\ ^ [0] in H p (Ki), one has a* / a*,. It follows that 

= [al,l]/[a;„l] = a^ 1 . 

More details for justification of Case (i) of elementary collapse. In this case, (n, v) satisfies the link 
condition. The elementary collapse fi : Ki — > Ki + \ maps vertices (u, v) to u. Recall that our algorithm for 
handling Case (i) of elementary collapse first performs a set of annotation transfers: in particular, for any 
mirror simplex r, we take its unique coface a that also contains the edge {u, v }, and add the annotation 
to all the cofaces of r of codimension 1. We claim the following. 

Claim B.l After all possible annotation transfers, 

(i) each vanishing simplex (with respect to (u, v)) has a zero annotation; and 

( ii) each mirror simplex r has the same annotation as its mirror partner simplex t'. 

Proof: First, note that the vanishing simplices are exactly those simplices containing the edge {u, v}. If 
(u, v) satisfies the link condition, then for every p-simplex a containing {u, v}, exactly two among its p — 1- 
faces are mirror simplices, and all other remaining p— 1-faces are vanishing simplices. It then follows that we 
perform an annotation transfer for every vanishing simplex. Furthermore, while performing an annotation 
transfer for a vanishing simplex a, no other co-face of same dimension can be a vanishing simplex. Hence, 
the annotation of each vanishing simplex a is updated exactly once, when we perform annotation transfer 
for a. By Proposition 13.41 its annotation becomes zero after that transfer and stays zero. This proves claim 
(i) above. 

For claim (ii), consider a pair of (p — 1) -dimensional mirror simplices r = {u, U2, ■ ■ ■ , u p } and r' = 
{v, U2-, ■ ■ ■ , u p }. Since (u, v) satisfies the link condition, it is necessary that thep-simplex a = {u, v, U2, ■ ■ ■ , u 
must exist in Ki. Thus, we have that ag a = 0. On the other hand, other than r and r', any (p — l)-face of 
a is a vanishing simplex, and by Claim (i), in the end, its annotation should be zero. We thus have that after 
all annotation transfers, ag a = a r + a r / = 0, implying that a r = a r /. ■ 
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Argument that Ki+\ has valid annotation after annotation transfer and collapse. Let z and z' be any 

two p-cycles in Ki + \. Let w and w' be two p-cycles in K{ so that f(w) = z and f(w') = z' . Then, 
[w] = [w'] if and only if [z] = [z'\ since /* : H p (Ki) — > H p (Ki + \) is an isomorphism (Proposition I3.6I ). 
With the modified annotation of K{ we have a l w = a' l w , if and only if [w] = [w']. Therefore, [z] = [z'\ in 
K i+ i if and only if a l w = a* /. The only simplices where z and w differ are either vanishing simplices or 
miiTor simplices. In the first case, the annotation is zero and in the second case the annotations are the same. 
So, a l w = a* +1 . Similarly, a l w , = a l J~ . Therefore, a z = a z > if and only if [z] = [z'\ in A' i+1 . This proves 
that the annotation for -fQ+i is valid. 



C Missing Details from Section |4] 

Proof of Claim 14.11 First, we consider the bottom triangle. Note that the vertex map Tr k restricted on the 
set of vertices V k is the same as the vertex map ir k . (That is, for a vertex u G Vfc C Vq, hk(u) = 
Thus hk = hk o jk- Hence, the bottom triangle commutes both at the simplicial complex level and at the 
homology level. 

Now consider the top triangle. We claim that the map j k+ i o h k is contiguous to the inclusion map 
if.. Since two contiguous maps induce the same homomorphisms at the homology level, the top triangle 
commutes at the homology level. 

This claim can be verified by the definition of contiguous maps. Given a simplex a G 7Z a ( 1+e ^ (Vq), 
we wish to show that vertices from a U h k (a) form a simplex in TZ a ( 1+e ) + (Vq). Since 7^ a ( 1 + e ) fe+1 (v ) i s 
a Rips complex, we only need to show that for any two vertices u and v from a U h}.(a), the edge {u, v} 
has length less than a(l + e) k+1 (and thus in TZ a ( 1+£ ) +1 (Vb)). If u and v are both from a or both from 
hk(a), then obviously \\u — v\\ < a(l + e) k+1 . Otherwise, assume without loss of generality that v G a and 
u G hj.(a) where u = 7Tfc(w) for some u G a. It then follows that, 

2 k 

\\u - v\\ < \\u - u\\ + \\u - v\\ < — ^(1 + e)' + ail + e) k < a(l + e) k+1 . 

"2(l + e)^ V 

Therefore, the vertices of a U hk{a) constitute a simplex in / R, a ( 1+e ) k+1 (Vq). 

The size of n a ^ 1+£ ^ (V k ). We compute V k+1 such that it is a 5-net of V k for 5 = 3^(1 + e)^ 1 by the 
following standard greedy approach: Let D(-,-) denote the metric on the set of input points P (and thus 
Vfcs). Starting with V k +i = 0, pick an arbitrary vertex from V k and add it to V k +\. In the zth round, there 
are already i points in V k+ \. We identify the point u from V k whose minimum distance to points in V k+ \ 
is the largest. We stop when either D(u, V k+ i) < 5 or V k+ \ = V k . By construction, when this process 
terminates, any point in V k is within 5 distance to some point in Vfc+i, and no two points in V k+ \ are within 
5 distance. A naive implementation of the above procedure takes 0(n 2 ) time. One can also compute the 
5-net V k+ \ more efficiently in 0(n log n) time (see, e.g, Ifl4l ). However, we remark that this step does not 
form a bottleneck in the time complexity as computing persistence diagrams takes time cubic in the number 
of simplices. 

Proposition C.l Suppose the set of input points V are from a metric space with doubling dimension d. For 
V k s constructed as described above, the number of p-simplices in 1Z a ( l+e ) (Vfc+i) is 0((i)°( rf P) • n)for 
< e < 1. 

Proof: For simplicity, set r := a(l + e) k+1 ; note that 5 = 2 (i+ £ )5 r - We first prove that there are 0((^) ^) 
number of edges for each vertex in V k+ \. Specifically, consider a node u G V k+ \. it will be connected to 
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all other vertices in Vk+i that are within distance r to u. Since Vk+i is a 5-net of Vk, every node in Vk+i 
has a ball centered at it with radius 6/2 that is empty of other points in Vk+\- Since the points are from a 

metric space with doubling dimension d, we can pack only 0((jj2) d ) = 0({ 4 ^ 1 ^ ) d ) = 0((-^) d ) (for 

< e < 1) number of balls of radius 6/2 in a ball of radius r. This means that there are only 0((j) 2d ) 
number of edges containing u, where the big-0 notation hides terms exponential in d. It then follows that 
the number of p-simplices containing u is 0((i) 2dp ). Since there are \V k+ 1 1 < n number of vertices in 
Vfc-f-i, the total number of p-simplices is bounded by 0((^) 0( - dp ^n) as claimed. ■ 
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